Overview

Dataset Statistics

Number of Variables 11
Number of Rows 582
Missing Cells 4
Missing Cells (%) 0.1%
Duplicate Rows 13
Duplicate Rows (%) 2.2%
Total Size in Memory 80.5 KB
Average Row Size in Memory 141.7 B
Variable Types
  • Numerical: 9
  • Categorical: 2

Dataset Insights

Total_Bilirubin is skewed Skewed
Direct_Bilirubin is skewed Skewed
Alkaline_Phosphotase is skewed Skewed
Alamine_Aminotransferase is skewed Skewed
Asparate_Aminotransferase is skewed Skewed
Albumin_Globulin_Ratio is skewed Skewed
Dataset has 13 (2.23%) duplicate rows Duplicates
Target has constant length 1 Constant Length

Variables


Age

numerical

Approximate Distinct Count 72
Approximate Unique (%) 12.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 44.7113
Minimum 4
Maximum 90
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Age is skewed left (γ1 = -0.0263)

Quantile Statistics

Minimum 4
5-th Percentile 18
Q1 33
Median 45
Q3 57.75
95-th Percentile 72
Maximum 90
Range 86
IQR 24.75

Descriptive Statistics

Mean 44.7113
Standard Deviation 16.1819
Variance 261.8546
Sum 26022
Skewness -0.02632
Kurtosis -0.5611
Coefficient of Variation 0.3619

Gender

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 40440
  • The largest value (Male) is over 3.13 times larger than the second largest value (Female)

Length

Mean 4.4845
Standard Deviation 0.8576
Median 4
Minimum 4
Maximum 6

Sample

1st row Male
2nd row Male
3rd row Male
4th row Male
5th row Male

Letter

Count 2610
Lowercase Letter 2028
Space Separator 0
Uppercase Letter 582
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Male, Female) take over 50.0%
  • The largest value (male) is over 3.13 times larger than the second largest value (female)

Total_Bilirubin

numerical

Approximate Distinct Count 113
Approximate Unique (%) 19.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 3.3033
Minimum 0.4
Maximum 75
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Total_Bilirubin is skewed right (γ1 = 4.8908)

Quantile Statistics

Minimum 0.4
5-th Percentile 0.6
Q1 0.8
Median 1
Q3 2.6
95-th Percentile 16.375
Maximum 75
Range 74.6
IQR 1.8

Descriptive Statistics

Mean 3.3033
Standard Deviation 6.2139
Variance 38.6129
Sum 1922.5
Skewness 4.8908
Kurtosis 36.7771
Coefficient of Variation 1.8811
  • Total_Bilirubin is not normally distributed (p-value 3.1184289233720933e-24)
  • Total_Bilirubin has 84 outliers

Direct_Bilirubin

numerical

Approximate Distinct Count 80
Approximate Unique (%) 13.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 1.4885
Minimum 0.1
Maximum 19.7
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Direct_Bilirubin is skewed right (γ1 = 3.2011)

Quantile Statistics

Minimum 0.1
5-th Percentile 0.1
Q1 0.2
Median 0.3
Q3 1.3
95-th Percentile 8.4
Maximum 19.7
Range 19.6
IQR 1.1

Descriptive Statistics

Mean 1.4885
Standard Deviation 2.8103
Variance 7.8979
Sum 866.3
Skewness 3.2011
Kurtosis 11.2217
Coefficient of Variation 1.888
  • Direct_Bilirubin is not normally distributed (p-value 1.0982546291095512e-23)
  • Direct_Bilirubin has 81 outliers

Alkaline_Phosphotase

numerical

Approximate Distinct Count 263
Approximate Unique (%) 45.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 290.7543
Minimum 63
Maximum 2110
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Alkaline_Phosphotase is skewed right (γ1 = 3.7519)

Quantile Statistics

Minimum 63
5-th Percentile 137
Q1 175.25
Median 208
Q3 298
95-th Percentile 698.55
Maximum 2110
Range 2047
IQR 122.75

Descriptive Statistics

Mean 290.7543
Standard Deviation 243.1089
Variance 59101.9516
Sum 169219
Skewness 3.7519
Kurtosis 17.5572
Coefficient of Variation 0.8361
  • Alkaline_Phosphotase is not normally distributed (p-value 2.097342879347584e-15)
  • Alkaline_Phosphotase has 66 outliers

Alamine_Aminotransferase

numerical

Approximate Distinct Count 152
Approximate Unique (%) 26.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 80.8247
Minimum 10
Maximum 2000
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Alamine_Aminotransferase is skewed right (γ1 = 6.5271)

Quantile Statistics

Minimum 10
5-th Percentile 15
Q1 23
Median 35
Q3 60.75
95-th Percentile 232
Maximum 2000
Range 1990
IQR 37.75

Descriptive Statistics

Mean 80.8247
Standard Deviation 182.7577
Variance 33400.3754
Sum 47040
Skewness 6.5271
Kurtosis 50.0523
Coefficient of Variation 2.2612
  • Alamine_Aminotransferase is not normally distributed (p-value 1.414934069878542e-23)
  • Alamine_Aminotransferase has 73 outliers

Asparate_Aminotransferase

numerical

Approximate Distinct Count 177
Approximate Unique (%) 30.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 110.0687
Minimum 10
Maximum 4929
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Asparate_Aminotransferase is skewed right (γ1 = 10.5112)

Quantile Statistics

Minimum 10
5-th Percentile 15.05
Q1 25
Median 42
Q3 87
95-th Percentile 400.95
Maximum 4929
Range 4919
IQR 62

Descriptive Statistics

Mean 110.0687
Standard Deviation 289.1419
Variance 83603.0245
Sum 64060
Skewness 10.5112
Kurtosis 149.3867
Coefficient of Variation 2.6269
  • Asparate_Aminotransferase is not normally distributed (p-value 9.567050572183426e-25)
  • Asparate_Aminotransferase has 66 outliers

Total_protiens

numerical

Approximate Distinct Count 58
Approximate Unique (%) 10.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 6.4826
Minimum 2.7
Maximum 9.6
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Total_protiens is skewed left (γ1 = -0.2833)

Quantile Statistics

Minimum 2.7
5-th Percentile 4.605
Q1 5.8
Median 6.6
Q3 7.2
95-th Percentile 8.1
Maximum 9.6
Range 6.9
IQR 1.4

Descriptive Statistics

Mean 6.4826
Standard Deviation 1.0863
Variance 1.1801
Sum 3772.9
Skewness -0.2833
Kurtosis 0.2156
Coefficient of Variation 0.1676
  • Total_protiens is not normally distributed (p-value 0.0002735668434329255)
  • Total_protiens has 8 outliers

Albumin

numerical

Approximate Distinct Count 40
Approximate Unique (%) 6.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 9312
Mean 3.1416
Minimum 0.9
Maximum 5.5
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Albumin is skewed left (γ1 = -0.0425)

Quantile Statistics

Minimum 0.9
5-th Percentile 1.8
Q1 2.6
Median 3.1
Q3 3.8
95-th Percentile 4.395
Maximum 5.5
Range 4.6
IQR 1.2

Descriptive Statistics

Mean 3.1416
Standard Deviation 0.7962
Variance 0.6339
Sum 1828.4
Skewness -0.04253
Kurtosis -0.399
Coefficient of Variation 0.2534

Albumin_Globulin_Ratio

numerical

Approximate Distinct Count 69
Approximate Unique (%) 11.9%
Missing 4
Missing (%) 0.7%
Infinite 0
Infinite (%) 0.0%
Memory Size 9248
Mean 0.9471
Minimum 0.3
Maximum 2.8
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Albumin_Globulin_Ratio is skewed right (γ1 = 0.9882)

Quantile Statistics

Minimum 0.3
5-th Percentile 0.5
Q1 0.7
Median 0.94
Q3 1.1
95-th Percentile 1.5
Maximum 2.8
Range 2.5
IQR 0.4

Descriptive Statistics

Mean 0.9471
Standard Deviation 0.3199
Variance 0.1023
Sum 547.45
Skewness 0.9882
Kurtosis 3.232
Coefficient of Variation 0.3377
  • Albumin_Globulin_Ratio is not normally distributed (p-value 1.3084558849443233e-10)
  • Albumin_Globulin_Ratio has 10 outliers

Target

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 38412
  • The largest value (1) is over 2.49 times larger than the second largest value (2)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 1
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 582
  • The top 2 categories (1, 2) take over 50.0%
  • The largest value (1) is over 2.49 times larger than the second largest value (2)
  • Target has words of constant length

Interactions

Correlations

Missing Values